An Extended ID 3 Decision Tree Algorithm for Spatial Data Imas
نویسندگان
چکیده
Ulilizing data mining tasks such as classification on spatial data is more complex than those on non-spatial data. It is because spatial data mining algorithms have to consider not only objects or interest Itself but also neighbours of the objects in order to extract useful and Interesting patterns. One or classilication algorithms namely the 103 algorithm which originally designed for a non-spatial dataset bas been Improved by other researchers in the previous work to construe! a spatial decision tree from a spatial dataset containing polygon features only. The objective of this paper is to propose a new spatial decision tree algorithm based on the ID3 algorithm for djscrete features represented in points, lines nod polygons. As in the ID3 algorithm that use information gain in the attribute seleclion, the proposed algorithm uses the spatial information gain to choose the best splitting layer from a set of explanatory layers. The new formula for spatial information gain is propost.'Cl using spatial measures for point, line and polygon features. Empirical result demonslratcs that the proposed algorithm can be used to join hvo spatial objects in constructing spatial decision trees on small spatiaJ dataset. The proposed aJgorithm has been applied to the real spatial dataset consisting of point and polygon features. The result is a spatial decision tree with 138 leaves and the accuracy is 74.72%. KeywordsID3 algorithm, spatial decision tree, spatiaJ information gain, spatial relation, spatial measure
منابع مشابه
Classification Model for Hotspot Occurrences Using Spatial Decision Tree Algorithm
Developing a predictive model for forest fires occurrence is an important activity in a fire prevention program. The model describes characteristics of areas where fires occur based on past fires data. It is essential as an early warning system for preventing forest fires, thus major damages because of fires can be avoided. This study describes the application of data mining technique namely de...
متن کاملEvaluation of liquefaction potential based on CPT results using C4.5 decision tree
The prediction of liquefaction potential of soil due to an earthquake is an essential task in Civil Engineering. The decision tree is a tree structure consisting of internal and terminal nodes which process the data to ultimately yield a classification. C4.5 is a known algorithm widely used to design decision trees. In this algorithm, a pruning process is carried out to solve the problem of the...
متن کاملA New Algorithm for Optimization of Fuzzy Decision Tree in Data Mining
Decision-tree algorithms provide one of the most popular methodologies for symbolic knowledge acquisition. The resulting knowledge, a symbolic decision tree along with a simple inference mechanism, has been praised for comprehensibility. The most comprehensible decision trees have been designed for perfect symbolic data. Classical crisp decision trees (DT) are widely applied to classification t...
متن کاملEnsemble Classification and Extended Feature Selection for Credit Card Fraud Detection
Due to the rise of technology, the possibility of fraud in different areas such as banking has been increased. Credit card fraud is a crucial problem in banking and its danger is over increasing. This paper proposes an advanced data mining method, considering both feature selection and decision cost for accuracy enhancement of credit card fraud detection. After selecting the best and most effec...
متن کاملSteel Buildings Damage Classification by damage spectrum and Decision Tree Algorithm
Results of damage prediction in buildings can be used as a useful tool for managing and decreasing seismic risk of earthquakes. In this study, damage spectrum and C4.5 decision tree algorithm were utilized for damage prediction in steel buildings during earthquakes. In order to prepare the damage spectrum, steel buildings were modeled as a single-degree-of-freedom (SDOF) system and time-history...
متن کامل